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EXCERPT  FROM  THE  RESEARCH  PROPOSAL  (June  1988) 


The  characterization  of  Junin  virus  Candid  #1  genome  is 
the  primary  goal  of  the  present  research  proposal.  Modern 
nucleotide  sequencing  technology  provides  a  means  of  precise 
description  of  any  genome.  However,  the  sequence  analysis  of 
Candid  #1  alone  will  not  provide  any  clue  to  the  mechanisms  of 
virulence  attenuation.  Therefore,  it  will  be  necessary  to 
compare  it  to  sequences  of  more  virulent  ancestors  in  the 
passage  history  leading  from  prototype  Junin  virus  to  Candid  #1 
vaccine. 

The  availability  of  these  genealogically  related  Junin 
virus  variants  (i.e.  Candid  #1,  XJ#44,  XJ#13)  permits  a  proper 
approach  to  the  issue  of  the  molecular  basis  of  arenavirus 
virulence. 

This  project  focuses  mainly  on  the  changes  in  the  8  RNA 
species,  since  a  plausible  working  hypothesis  is  that  changes 
in  tissue  tropism  affecting  neurovirulence  could  possibly  be 
related  to  changes  in  the  viral  surface  glycoproteins  encoded 
in  this  RNA  segment.  Further  progress  of  this  research  will 
provide  nucleotide  and  aminoacid  sequence  data  pointing  at 
molecular  changes  that  will  hopefully  correlate  with 
differences  in  virulence.  Highly  variable  regions  will  also  be 
highlighted. 

The  comparison  of  the  sequences  of  the  "avirulent"  Candid 
#1  to  more  than  one  "virulent"  ancestor  will  also  allow  for  an 
estimation  of  the  genome  stability  or  the  probability  of  rever¬ 
sion  to  the  "virulent"  phenotype.  This  is  a  question  relevant 
to  the  theoretical  estimation  of  the  risk  of  using  Candid  #1  as 
a  vaccine,  although  the  phenotypic  stability  of  it  has  been 
already  assessed  empirically. 

In  addition,  the  molecular  analysis  of  the  viral  nucleic 
acid  can  be  used  as  a  part  of  a  thorough  quality  control  of 
vaccine  preparations. 
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In  the  frame  of  the  extended  AHF  vaccine  program,  a 
careful  evaluation  of  the  clinical  and  subclinical  AHF  cases 
must  include  the  genotypic  identification  of  the  viruses 
isolated  from  the  patients  in  the  endemic  area.  Nucleic  acid 
sequence  analysis  is  the  unambiguous  piece  of  data  that  can 
tell  us  whether  the  disease  is  caused  by  an  indigenous  virus  or 
a  virus  derived  by  mutation  from  the  vaccine  strain.  The 
amplification  of  nucleic  acids  using  the  polymerase  chain 
reaction  (PCR)  could  be  employed  for  this  purpose  based  on  the 
nucleotide  sequence  data  generated  in  this  project.  This 
particular  technology  is  being  tested  in  order  to  evaluate  its 
potential  as  a  laboratory  diagnostic  assay. 

In  summary,  the  present  project  shoud  be  regarded  as  a 
first  step  in  the  identification  of  regions  in  Junin  virus 
genome  related  to  a  particular  virulence  pattern.  The  other 
major  objective  is  to  provide  a  molecular  description  of  the 
AHF  vaccine  strain. 
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INTRODUCTION 


Junin  virus  is  the  etiological  agent  of  Argentine  hemorragic  fever 
(AHF).  The  clinical  symptoms  of  AHF  include  hematologic,  neurologic, 
cardiovascular,  renal  and  immunologic  alterations.  The  mortality  rate  may 
be  as  high  as  30%,  but  early  treatment  with  immune  plasma  reduces  the  fatal 
cases  to  less  than  2%.  (Maiztegui  et  al . ,  1979;  Enria  et  al . ,  1986). 

A  wide  variation  in  annual  number  of  cases  occurs,  ranging  from  100 
to  4000.  Since  its  recognition  in  the  early  1950s,  the  disease  has  spread 
from  an  area  of  16.000  km2  to  an  area  greater  than  120.000  km2  of  the 
richest  farming  land  in  Argentina,  with  a  population  of  ca.  2  million 
inhabitants  (Maiztegui  et  al . ,  1986).  The  major  rodent  hosts  for  Junin 
virus  are  Calomys  muscul inus  and  Calomys  laucha  (Maiztegui,  1975),  although 
infection  has  been  detected  on  occasion  in  Mus  musculus ,  Akodon  azarae  and 
Orizomys  flavescens  (Sabattini  et  al . ,  1977).  The  human  population  at  risk 
is  composed  mainly  by  field  workers.  Humans  are  believed  to  become  infected 
through  cuts  or  skin  abrasions  or  through  airborne  dust  contaminated  with 
urine  and  saliva  of  infected  rodents.  A  high  degree  of  variation  in 
virulence  and  clinical  patterns  has  been  reported  for  different  isolates  of 
Junin  virus  (Maiztegui,  1975;  Me  Kee  et  al . ,  1985;  Kenyon  et  al . ,  1988). 

A  collaborative  effort  conducted  by  US  and  Argentine  Governments  led 
to  the  production  of  a  live,  attenuated  Junin  virus  vaccine  (Peters  et  al . , 
1987).  After  rigorous  biological  testing  in  rhesus  monkeys  (Me  Kee  et  al . , 
1984),  the  highly  attenuated  Junin  virus  variant  named  Candid  #1  was  used 
in  human  volunteers,  followed  by  an  extensive  clinical  trial  in  the  AHF 
endemic  area  (Maiztegui  et  al . ,  1990). 

Our  interests  are  directed  towards  the  characterization  of  the 
vaccine  strain  at  the  molecular  level.  In  addition,  the  availability  of 
several  genealogically  related  Junin  virus  variants  with  an  increasing 
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degree  of  attenuation  (e.g.  XJ  #13,  XJ  #44  and  Candid  #1),  derived  in  the 
process  of  developing  the  AHF  vaccine,  permits  a  systematic  approach  to  the 
issue  of  the  molecular  basis  of  Junin  virus  virulence. 

In  the  last  few  years  a  reasonable  amount  of  information  has  been 
accumulating  related  to  the  genome  structure  of  some  members  of  the 
Arenaviridae  family,  to  which  the  Junin  virus  also  belongs. 

All  the  arenaviruses  share  morphologic  and  biochemical  properties; 
they  are  enveloped  and  their  genome  consists  of  two  single-stranded  RNA 
species,  designated  L  (for  large,  ca.  7  kb)  and  S  (for  small,  ca.  3.5  kb). 
The  L  RNA  codes  for  two  proteins:  a  large  L  polypeptide,  presumed  to  be  the 
RNA  polymerase,  and  a  small  zinc  finger-like  protein  (Salvato  and 
Shimomaye,  1989;  Salvato  et  al . ,  1989;  lapalucci  et  al . ,  1989a,  1989b).  The 
S  RNAs  of  several  arenaviruses  have  been  sequenced  and  shown  to  have 
several  common  features.  The  nucleocapsid  protein  gene  N  and  the  precursor 
of  the  envelope  glycoproteins  GPC  are  encoded  in  an  ambisense  manner 
(Bishop,  1988). 

The  N  protein  (ca.  63  kDa)  is  translated  from  a  viral -complementary 
or  anti -genome-sense  mRNA  species  complementary  to  the  3'  half  of  the  viral 
S  RNA  (Auperin  et  al . .  1984a,  1984b;  Romanowski  and  Bishop,  1985;  Clegg  and 
Oram,  1985;  Franze-Fernandez  et  a/.,  1987).  The  GPC  protein  (ca.  57  kDa  in 
the  unglycosylated  form)  is  translated  from  a  viral  or  genome-sense  mRNA 
corresponding  to  the  5'  half  of  the  viral  S  RNA  (Auperin  et  a/.,  1984c; 
Romanowski  et  al . ,  1985;  Auperin  et  al . ,  1986;  Franze-Fernandez  et  al . , 
1987). 

Recently,  the  complete  nucleotide  sequence  of  Junin  virus  S  RNA  has 
been  determined  in  our  laboratory.  The  information  has  been  obtained  from 
cDNA  clones  derived  from  the  MC2  strain,  a  wild  type  Junin  virus  of 
intermediate  virulence  (Ghi ringhel 1  i  et  al.,  1989;  Ghiringhelli  et  al., 
submitted).  These  data  were  instrumental  for  the  molecular  characterization 


7 


of  the  attenuated  variant  of  Junin  virus  named  Candid  #1.  Here  we  report 
the  complete  sequence  of  the  structural  glycoproteins  of  the  attenuated 
Junin  virus  vaccine  strain,  Candid  #1,  and  their  relationship  to  Junin 
virus  MC2  and  other  closely  and  distantly  related  arenaviruses. 


MATERIALS  and  METHODS 

Virus  and  cell  lines 

The  Junin  virus  vaccine  strain,  Candid  #1  was  kindly  provided  by  Drs. 
J.I.  Maiztegui  and  A.M.  Ambrosio  from  the  INEVH  (Pergamino,  Argentina)  and 
propagated  in  certified  fetal  rhesus  lung  diploid  cells  (FRhL-2)  at  passage 
levels  19  to  28.  A  "working  stock"  of  Junin  Candid  #1  virus  was  produced  by 
infection  of  FRhL-2  cell  monolayers  with  the  master  seed  (Barrera  Oro  ARG 
8/009,  14.8.81)  at  an  m.o.i.  of  0.01  PFU/cell.  The  supernatant  media  were 
harvested  96  h  post  infection,  aliquoted  and  stored  at  -70°C.  This  virus 
stock  was  used  as  inoculum  to  produce  virus  particles  for  biochemical 
studies. 

Preparation  of  viral  RNA 

When  60%  confluent,  FRhl-2  cell  monolayers  were  infected  with  a 
multiplicity  of  infection  of  0.1  pfu/cell  and  incubated  at  37°C  in  Eagle's 
minimal  essential  medium  with  non-essential  aminoacids  (MEM)  and  10%  fetal 
calf  serum.  The  virus  was  recovered  and  purified  from  the  supernatant  media 
on  day  4  post - i nfec t i on  (Grau  et  al . ,  1981,  Rosas  et  al . ,  1988). 

Virions  pelleted  by  ultracentrifugation  were  disrupted  with 
guanidinium  thiocyanate  and  the  RNA  was  isolated  essentially  according  to 
the  procedure  of  Chonczynski  and  Sacchi  (1987).  The  RNA  pellet  was 
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resuspended  in  0.5%  Sarkosyl,  phenol  extracted  and  reprecipitated  with  Q.3M 
sodium  acetate  and  2.5  volumes  of  ethanol.  The  final  pellet  was  resuspended 
in  water  and  stored  in  aliquotes  at  -70°C. 

The  RNA  was  analyzed  by  agarose  gel  electrophoresis  in  the  presence 
of  methyl  mercury ( 1 1 )  hydroxide  as  denaturant  (Bailey  and  Davidson,  1976). 

Synthesis  and  cloning  of  cDNA 

Synthetic  oligonucleotides  complementary  to  nucleotides  1-19,  846-862 
and  2300-2316  (nucleotide  numbers  start  at  the  3'  end)  of  Junin  virus  MC2  S 
RNA  (Ghiringhel 1 i  et  al . ,  1989,  1990)  were  phosphoryl ated  and  used  to  prime 
cDNA  synthesis.  The  reaction  containing  viral  RNA,  primers,  deoxynucleoside 
triphosphates  and  AMV  reverse  transcriptase  was  carried  out  in  a  way 
similar  to  that  described  by  Gubler  and  Hoffman  (1983). 

The  double  stranded  cDNA  was  blunt-ended  using  T4  DNA  polymerase, 
size-fractionated  by  gel  electrophoresis  and  glass  powder  elution  to 
prevent  the  generation  of  a  high  proportion  of  clones  with  short  inserts. 

The  ds  cDNAs  in  the  range  of  1.0  -  4.0  kbp  were  inserted  by  ligation  in  the 
Smal  site  of  pBS  SIC  (Bluescript  SIC,  Stratagene,  La  Jolla,  CA)  and  cloned 
in  E.  coli  DH5  F'a.  The  colonies  containing  pBS  SK+  with  inserts  were 
selected  by  color  on  LB-agar  plates  containing  ampicillin,  IPTG  and  X-gal. 

Alternatively,  Bst  XI  adapters  were  ligated  to  the  ds  cDNA  prior  to 
the  size- fractionation  step,  and  the  cDNA  was  inserted  in  the  Bst  XI  site 
of  the  pcDNA  II  vector  ( Invi trogen,  San  Diego,  CA). 

The  screening  of  the  recombinant  clones  was  performed  by  colony 
hibridization  (Grunstein  and  Hogness,  1975)  using  Junin-MC2  S  RNA  probes. 
The  probes  consisted  of  selected  DNA  fragments  that  were  obtained  from 
recombinant  plasmids  containing  Junin-MC2  sequences  and  labeled  by  nick 
translation.  Plasmid  DNA  was  prepared  from  the  hybridization  positive 
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clones,  analyzed  by  gel  electrophoresis  after  restriction  enzyme  digestion 
and  tested  by  Northern  blot  (Maniatis  et  al . ,  1982). 

Northern  blot  analysis 

Viral  and  cellular  RNAs  were  denatured  with  lOmM  CHjHgOH  and 
electrophoresed  on  a  1%  agarose  gel  according  to  Bailey  and  Davidson 
(1976). The  gel  was  soaked  in  14  mM  B-mercaptoethanol  and  0.5  ^tg/ml  ethidium 
bromide,  to  check  the  quality  of  the  RNA  preparation,  and  transferred  by 
capillary  blotting  onto  a  Gene  Screen  nylon  membrane  (NEN,  Chicago,  1L, 
USA).  The  blot  was  processed  according  to  the  manufacturers  specifications 
and  hybridized  to  different  cDNA  probes.  The  32P  labelled  cDNA  probes  were 
prepared  by  nick  translation  (Maniatis  et  al . ,  1982)  or  mul tipriming 
(Feinberg  and  Vogelstein,  1983)  of  the  recombinant  plasmid  DMAs.  Northern 
blots  were  employed  to  assess  the  identity  of  the  cDNA  clones  that  were 
further  analyzed  by  nucleotide  sequencing. 

Sequence  analysis 

The  nucleotide  sequences  of  the  cDNA  inserts  were  determined  by  the 
dideoxy  method  of  Sanger  et  al .  (  1977)  using  a  modified  T7  DNA  polymerase 
(Sequenasem,  USB,  Cleveland,  OH,  USA).  The  original  cDNA  inserts  were 
subcloned  into  M 1 3  mpl8  and  mpl9  and  the  ssDNAs  were  used  as  templates  for 
sequencing  (Yan i sh- Perron  et  al . ,  1985). 

The  sequence  information  was  processed  on  a  personal  computer  using 
the  DNASIS  (Hitachi  Software  Engineering  Co.)  program  and  on  a  VAX  computer 
using  the  package  by  Genetics  Computer  Group  (GCG,  Madison,  WI,  USA). 
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RESULTS 


Holecular  cloning  and  nucleotide  sequence 
of  Jum'n-Candid  #1  virus  S  RNA 

Although  three  different  oligonucleotides  were  used  simultaneously 
(primers  A,  B  and  C,  Fig.  1),  the  hybridization-positive  clones  that  were 
recovered  contained  cDNA  inserts  beginning  with  only  two  of  them.  So  far, 
no  cDNA  clones  were  obtained  starting  with  the  oligonucleotide-primer 
complementary  to  the  3'  terminal  sequence  conserved  among  all  the 
arenavirus  S  RNAs  ( 5 ' CGCACAGT GGATCCTAGGC ,  Auperin  et  al . ,  1982).  The  cDNA 
clones  were  shown  to  anneal  specifically  to  Junin  virus  S  RNA  in  Northern 
blot  hybridizations  (data  not  shown). 

The  nucleotide  sequences  of  several  overlapping  cDNA  clones  shown  in 
Fig.  1  were  determined  using  the  dideoxy  chain  termination  method  (Sanger 
et  a?.,  1977).  In  this  analysis  all  the  overlapping  sequences  were 
identical.  The  nucleotide  sequence  presented  in  Fig.  2  comprises  the  5' 
non-coding  region,  the  GPC  gene,  the  intergenic  region  and  the  terminus  of 
the  N  gene. 

The  open  reading  frame  (ORF)  of  viral  polarity  starts  at  positions 
89-91  and  stops  at  1544-1546  and  codes  for  the  GPC  glycoprotein  precursor. 

Junin  virus  (Candid  #1)  glycoprotein  precursor  GPC 

The  first  AUG  codon  of  the  genomic  or  viral -sense  (v)  S  RNA,  found  at 
nucleotides  89-51,  initiates  a  long  ORF  that  remains  open  down  to  the  in- 
phase  UAA  translational  termination  signal  at  positions  1544-1546.  Seven 
codons  downstream  from  the  first  AUG,  a  second  AUG  is  found  in  the  same 
frame.  However,  based  upon  the  observations  on  most  frequent  flanking 
sequences  of  eukaryotic  translation  start  codons  reported  by  Kozak  (1984) 
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(i.e.  CC^CCAUGG),  the  first  AUG  is  considered  to  function  as  the  true 
initiation  codon  since  a  G  residue  is  found  at  positions  +4  and  -3,  and 
there  is  also  a  C  at  position  -2.  The  molecular  weight  calculated  for  the 
485  amino  acids  long  translation  product  (i.e.  55,700  Da)  is  smaller  than 
the  value  estimated  by  gel  electrophoresis  for  the  intracellular 
glycoprotein  precursor  that  undergoes  cotranslational  glycosylation  (De 
Mitri  and  Martinez  Segovia,  1985;  Rustici,  1984). 

Eight  potential  N-glycosyl ation  sites  are  found  in  the  GPC  sequence 
(Fig.  3).  Whether  all  are  used  for  the  attachment  of  carbohydrate  side 
chains  is  not  known. 

Amino  acid  sequence  homology  of  arenavirus  GPC  gene  products 

Junin  virus  GPC  protein  sequence  was  aligned  with  the  rest  of  the 
arenavirus  GPCs  as  depicted  in  Fig.  4.  The  best-fit  alignment  shows  that 
the  amino  acid  sequence  homology  is  concentrated  in  two  regions:  the  53 
residues  at  the  amino-terminus  and  the  carboxy-proximal  half  of  GPC. 
Buchmeier  et  al .  (1987)  demonstrated  that,  in  the  infected  cell,  LCM  virus 
GPC  is  cleaved  at  or  near  the  Arg262  -Arg263  to  yield  the  proteins  G1  (amino- 
terminal  half)  and  G2  (carboxy-terminal  half).  By  analogy,  Junin  virus  GPC 
should  be  cleaved  at  or  near  the  homologous  Arg243  -Arg244  (MC2)  or  Arg247- 
Arg24g  (Candid  #1 ) . 

The  G1  and  G2  regions  of  GPC  defined  by  the  dibasic  amino  acid 
sequence  at  positions  243  to  286  —depending  on  the  virus —  exhibit  quite 
different  degrees  of  sequence  homology.  G1  sequences  show  few  clusters  of 
sequence  conservation  among  all  the  different  arenaviruses.  These 
homologous  regions  concentrate  especially  at  the  positions  1  to  53  and  84 
to  97. 
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This  sequence  comparison  shows  that  Junin  and  Tacaribe  G1  sequences 
are  by  far  more  homologous  (50%)  than  the  other  pairs  analyzed  (Table  1). 
The  sequence  similarity  is  even  higher  for  the  G2  polypeptide,  where  82%  of 
the  amino  acids  are  identical  in  nature  and  position.  Comparison  of  Junin 
G2  with  the  other  arenavirus  G2  proteins  also  shows  a  high  degree  of 
homology,  in  the  range  of  53-57%,  suggesting  that  there  is  less  sequence 
flexibility  than  in  Gl,  compatible  with  function  conservation.  The  homology 
figures  are  higher  when  conservative  amino  acid  changes  are  considered  in 
the  calculation  (Table  2). 

Four  of  the  putative  N-glycosyl ation  sites  of  GPC  are  conserved  in 
all  the  arenaviruses  (one  in  the  Gl  region  and  three  in  the  G2,  Fig.  4). 
This  fact,  however,  is  not  necessarily  linked  with  the  actual  use  of  these 
or  the  other  less  conserved  glycosylation  sites. 

Comparison  of  the  glycoprotein  genes 
of  Junin  virus  strains  Candid  #1  and  HC2 

The  alignment  of  the  coding  sequences  of  MC2  and  Candid  #1  GPC  genes 
shows  an  overall  high  degree  of  homology  (Fig.  4).  Although,  all  the 
nucleotide  sequence  alterations  are  summarized  in  Table  3,  there  are  major 
changes  that  will  be  pointed  out  here.  There  are  several  nucleotide 
insertions  and  deletions  concentrated  between  the  codons  43  and  76  of 
Junin-MC2  GPC  gene  that  should  be  formally  considered  to  modify  its 
sequence  to  that  of  Candid  #1  GPC  (codons  43-80,  Figs.  3  and  5). 

There  are  six  single-nucleotide  insertions  that  change  MC2  amino  acid 
sequence  43CSILD47  into  43LFQFFVF49  in  Candid  #1.  Two  triplets  downstream, 
there  is  a  six-nucleotide-block  insertion  resulting  in  the  insertion  of  two 
amino  acids  (LA,  codons  52  y  53).  Further  downstream,  one  transversion  and 
three  single-nucleotide  insertions  change  the  amino  acid  sequence  54PR55  to 
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to  58TEE6Q.  The  other  major  change  results  from  three  different  single¬ 
nucleotide  deletions  that  shift  the  reading  frame  of  the  coding  sequence 
converting  the  MC2  sequence  65VPDCVLLQWWVS76  to  70FQTVSFSMVGL80. 

Most  of  the  other  nucleotide  substitutions  do  not  change  the  coding 
with  the  exception  of  codons  139,  423  and  442  in  MC2  or  corresponding 
codons  143,  427  and  446  in  Candid  #1,  that  result  in  three  amino  acid 
changes,  two  of  which  are  conservative  according  to  the  considerations  of 
Schwartz  and  Dayhoff  (1979). 

As  shown  in  Fig.  6,  the  distribution  of  the  amino  acid  sequence 
changes  in  MC2  and  Candid  #1  GPC  gene  products  concentrates  in  the  amino- 
proximal  region  of  G1  in  contrast  with  the  changes  in  the  nucleotide 
sequence  that  are  scattered  through  the  carboxyl -terminus  of  G2.  However, 
most  of  the  nucleotide  substitutions  in  G2  do  not  alter  the  amino  acid 
sequence. 

None  of  the  amino  acid  residues  that  are  different  in  both  Junin 
virus  strains  alters  any  of  the  eight  potential  N-glycosyl ation  sites. 

Non  coding  nucleotide  sequences 

The  untranslated  upstream  sequence  for  Jum'n-Candid  #1  virus  GPC  gene 
comprises  88  nucleotides,  86  of  which  are  identical  to  the  homologous 
region  in  Junin-MC2  S  RNA.  The  two  only  differences  that  were  found  consist 
of  an  extra  cytosine  at  the  5'  end  of  Candid  #1  RNA  and  the  substitution  of 
an  A  for  a  G  at  position  38. 

The  non-coding  region  dowstream  from  the  translation  termination 
signals  for  both  GPC  and  N  genes  (nucleotides  1544-1546  and  1638-1640  of  v 
S  RNA)  contains  two  sets  of  self-complementary  nucleotide  sequences  capable 
of  forming  two  very  stable  hairpin-loop  structures  (Fig.  7). 
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The  hairpin  closest  to  the  GPC  stop  codon  is  formed  by  11  GC  and  2  AU 
pairs  (AG°=  -39.0  kcal/mol,  Tinoco  et  al . ,  1973)  and  the  one  closest  to  the 
N  stop  codon  is  stabilized  by  15  GC  and  1  AU  pairs  (AG°  =  -57.2  kcal/mol. 
The  nucleotide  sequences  of  Junin-MC2  and  Junin-Candid  #1  are  identical  but 
for  three  nucleotides.  Moreover,  two  of  the  nucleotide  changes  (C-U,  A-*C) 
fall  outside  the  double  hairpin-loop,  and  the  third  one  (U-C)  does  not 
affect  the  thermodynamic  stability  of  the  secondary  structure  either,  since 
it  is  located  at  the  top  of  the  first  hairpin-loop  (Fig.  7).  One  hairpin- 
loop  structure  has  been  found  in  the  intergenic  region  of  each  of  the 
arenaviruses  that  have  been  sequenced  to  date  (Auperin  et  al . ,  1984; 
Romanowski  and  Bishop,  1985;  Auperin  et  al . ,  1986;  Franze-Fernandez  et  al . , 
1987).  In  this  context  Junin  virus  S  RNA  has  an  unusual  secondary 
structure. 


DISCUSSION 

Changes  in  the  amino  acid  sequence  of  the  glycoprotein  G1  might  be 
responsible  for  the  attenuated  phenotype  of  Jum'n-Candid  #1  vaccine  strain 

We  completed  the  nucleotide  sequence  of  the  glycoprotein  precursor 
gene  of  Junin-Candid  H 1  virus  and  its  flanking  untranslated  regions. 

During  these  studies,  special  attention  was  devoted  to  avoid  spurious 
genetic  variations  that  could  possibly  obscure  the  changes  relevant  to  the 
attenuation  of  virulence.  Taking  into  account  that  arenaviruses  are 
notorious  for  variations  in  their  biologic  properties  based  on  passage 
history  (Ahmed  et  al . ,  1984),  the  biochemical  studies  were  conducted  on 

v;rions  that  had  the  same  manipulations  as  the  vaccine  stock.  Junin  (Candid 
#1)  virus  was  passaged  in  diploid  FRhL-2  cells  the  same  limited  number  of 
times  as  the  vaccine  in  order  to  avoid  genotypic  variations  that  could 
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arise  from  accumulation  of  mutations  upon  repeated  passage  or  a  different 
type  of  selection  posed  on  the  virus  if  a  different  cell  substrate  was 
used. 

The  information  obtained  from  cDNA  clones  of  the  attenuated  Junin 
virus  vaccine  has  been  compared  with  the  homologous  regions  of  Junin-MC2 
and  other  arenavirus  S  RNAs  and  their  gene  products. 

The  untranslated  regions  flanking  the  GPC  genes  of  different 
arenaviruses  are  variable  in  length,  sequence  and  secondary  structure. 
However,  when  the  highly  attenuated  Junin  virus  strain,  Candid  §1,  is 
compared  with  the  MC2  strain,  of  intermediate  virulence,  the  similarities 
are  notorious.  The  differences  consist  of  an  extra  C  at  the  5'  end  of 
Candid  #1  and  a  G  in  position  38  instead  of  the  A  present  in  MC2.  The 
secondary  structure  in  the  intergenic  region  downstream  of  the  GPC  and  N 
open  reading  frames  of  Candid  #1  is  identical  to  that  of  MC2  with  only 
three  nucleotide  changes,  that  do  not  affect  the  stability  of  the  double 
hairpin-loop. 

In  poliovirus  type  3,  one  single  base  mutation  in  the  5'  non  coding 
region  C472-»T472  has  been  demonstrated  to  be  responsible  of  the  attenuated 
phenotype  of  the  vaccine  strain  P3/Leon  12a, b.  On  the  other  hand,  another 
mutation  in  the  VP3  coding  sequence  that  changed  a  serine  to  a 
phenylalanine  codon  was  also  shown  to  contribute  to  the  attenuation  of  the 
neurovirulence  (Westrop  et  al . ,  1986) 

In  Junin-Candid  #1  it  cannot  be  demonstrated  directly  whether  the 
nucleotide  substitutions  in  the  non  coding  region  play  a  role  in 
attenuation  of  the  virulence.  However,  the  striking  differences  in  the  GPC 
coding  sequence  of  Candid  §1  and  MC2  suggest  that  these  might  be  more 
relevant  to  the  changes  in  virulence  and  tissue  tropism. 

The  nucleotide  sequence  changes  found  in  the  GPC  ORF  when  Candid  #1 
and  MC2  S  RNAs  are  compared,  concentrate  in  the  NH2-proximal  (20  nucleotide 
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changes)  and  the  COOH-proximal  regions  (19  nucleotide  substitutions). 
However,  from  Fig.  6  it  becomes  apparent  that  the  major  amino  acid  residue 
changes  resulting  from  the  above  mentioned  nucleotide  sequence  alterations 
are  located  in  the  NH2-proximal  region  of  GPC.  This  part  of  the  coding 
sequence  corresponds  to  the  G1  protein,  which  — by  analogy  with  LCM  (Burns 
et  al . ,  1990) —  is  thought  to  be  exposed  on  the  surface  of  the  virion,  in 
association  with  the  more  internal  G2  protein. 

The  amino  acid  sequence  changes  occur  in  a  region  that  contains  the 
proposed  signal  peptidase  cleavage  site  (Burns  et  al . ,  1990).  Therefore, 
the  altered  G1  amino  acid  sequence  would  correspond  to  the  actual  amino- 
terminus  of  this  protein.  By  contrast,  only  two  isolated  amino  acid  residue 
substitutions  are  found  in  the  G2  protein  of  Candid  #1,  despite  the  fact 
that  the  number  of  nucleotide  changes  in  this  region  of  the  genome  is 
similar  to  the  one  in  the  G1  sequence. 

Although  our  results  that  suggest  the  involvement  of  the  surface 
glycoprotein  in  the  attenuation  of  virulence  are  preliminary,  !.hey  are 
consistent  with  the  reports  on  other  viruses. 

It  should  be  noted,  however,  that  for  each  attenuated  vaccine  strain 
analyzed  (poliovirus  types  1  and  3  and  yellow  fever)  suggestive  or 
experimental  evidence  has  been  obtained  for  a  role  of  altered  viral  surface 
protein  in  attenuation  (Nomoto  et  al . ,  1982;  Hahn  et  al . ,  1987).  As 
suggested  by  the  literature  on  different  viruses  the  amino  acid 
substitution  can  have  profound  effects  on  virulence  and  tissue  tropism 
(Spriggs  and  Fields,  1982;  Spriggs  et  al . ,  1983;  Coulon  et  al . ,  1983; 
Johnson  et  al.,  1986;  Prehaud  et  a/.,  1990). 

Significant  progress  has  been  made  in  understanding  the  genetic  basis 
of  attenuation  of  the  type  1  poliovirus  vaccine  strain  using  the 
recombinant  viruses  obtained  by  in  vitro  manipulation  of  infectious  cDNA 
(Nomoto  et  a/.,  1987;  Racaniello  and  Baltimore,  1981).  This  approach  has 
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proved  to  be  very  productive  because  the  changes  at  the  nucleotide  level 
can  be  precisely  directed  and  the  resulting  phenotypes  analyzed  for  changes 
in  the  biological  properties.  Unfortunately,  at  this  time,  it  is  impossible 
to  regenerate  infectious  viruses  from  cDNA  of  segmented  RNA  genomes. 
Therefore,  we  will  have  to  resort  to  the  more  classical  approach  of 
analyzing  the  genomes  of  spontaneously  generated  mutants  with  different 
virulence  phenotypes.  As  a  short  cut  to  this  end,  the  variants  derived  in 
the  process  of  development  of  the  Candid  #1  strain  will  be  analyzed  as 
mentioned  elsewhere  in  this  report. 

In  particular,  the  literature  on  viruses  such  as  paramyxo  and 
orthomyxoviruses  indicates  that  tissue  tropism  and  virulence  in  a 
particular  host  are  dependent  upon  the  proteolytic  activation  of  a  surface 
glycoprotein  (Tashiro  et  al . ,  1990;  Deshpande  et  al . ,  1987).  Although,  the 
cleavability  of  the  arenavirus  GPC  has  not  been  studied  in  relation  to  the 
infection  process,  we  have  not  found  any  amino  acid  sequence  changes  at  or 
around  the  cleavage  site  of  Junin  Candid  #1  GPC.  Therefore,  it  might  be 
concluded  that  this  region  plays  no  role  in  the  attenuation  of  Candid  #1. 

The  biological  significance  of  the  nucleotide  changes  found  in  the 
attenuated  and  wild  type  strains  of  Junin  virus  may  be  provisionally  sought 
by  evaluation  of  the  secondary  structures  predicted  for  the  RNAs  and  the 
proteins.  At  this  time,  we  have  only  compared  the  sequences  of  a  highly 
attenuated  Junin  virus  with  the  data  obtained  in  our  laboratory  from  the 
cDNA  of  a  wild  type  virus  isolate. 

Although  these  findings  are  strongly  suggestive  of  the  genetic 
changes  that  lead  to  an  attenuated  phenotype,  they  must  be  complemented 
with  studies  on  the  direct  predecessors  of  Candid  #1,  that  exhibit 
gradually  more  virulent  phenotypes,  in  order  to  locate  precisely  the 
mutations  responsible  for  attenuation. 
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Finally,  the  identification  of  a  highly  variable  region  in  the  S  RNA 
of  Junin  virus  provides  a  valuable  marker  for  monitoring  the  stability  of 
this  virus  or  the  identification  of  different  virus  isolates  recovered  from 
AHF  patients. 

Confirmation  of  an  unusual  secondary  structure  in  the  intergenic  region 

The  nucleotide  sequence  of  Junin-MC2  virus  S  RNA  showed  a  potential 
secondary  structure  in  the  intergenic  region  consisting  of  two  consecutive 
hairpin-loops  (Ghiringhel 1 i  et  al . ,  1987).  This  unusual  structure  differed 
from  the  single  hairpin-loop  found  in  other  arenavirus  S  RNAs  but  was 
confirmed  by  sequence  analyses  of  several  independently  generated  cDNA 
clones  (Ghiringhel 1 i  et  al . ,  submitted  for  publication). 

A  structure  identical  to  the  one  described  for  MC2  S  RNA  has  now  been 
determined  in  the  vaccine  strain  Candid  #1.  It  is  interesting  to  note  that, 
while  this  report  was  in  preparation,  a  new  corrected  version  of  the 
Tacaribe  virus  S  RNA,  that  was  made  available  to  us  (M.T.  Franze-Fernandez, 
personal  communication),  also  showed  two  potential  hairpin-loop  structures 
in  the  intergenic  region.  Although,  the  nucleotide  sequence  that  forms  the 
double  hairpin-loop  in  Tacaribe  S  RNA  is  quite  similar  to  the  homologous 
stretch  in  Junin  virus,  the  complete  intergenic  region  is  much  longer. 

Another  difference  of  Junin  virus  with  respect  to  the  rest  of  the 
arenaviruses  is  located  in  the  very  5'  end  sequence.  The  5'  termini  of 
Pichinde,  Lassa,  LCM  and  Tacaribe  virus  S  RNAs  have  been  reported  to 
contain  the  conserved  sequence  5'  CGCACCGGGGAUCCUAGGC,  with  an  extra  G  at 
the  5'  end  in  the  case  of  Pichinde  and  Lassa  (Auperin  et  al . ,  1984c; 
Romanowski  et  al . ,  1985;  Auperin  et  al . ,  1986;  Franze-Fernandez  et  a/., 
1987).  The  same  portion  of  the  5'  end  sequence  of  Junin-MC2  virus  is 
slightly  different  (i.e.  UGCAGUAAGGGGAGCCUAGGC ;  the  differences  are 
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underlined)  but  the  base-pairing  with  the  3'  end  of  the  S  RNA  yields  a 
panhandle  structure  similar  to  the  one  predicted  for  the  rest  of  the 
arenavi ruses .  The  sequence  found  in  Junin-MC2  has  been  confirmed  in 
independent  clones  of  Junin-Candid  #1  cDNA,  although  an  extra  C  was 
determined  at  the  5'  end. 

These  findings  suggest  that  the  non  coding  regions  of  the  S  RNAs 
could  be  used  to  characterize  closely  related  arenaviruses  as  belonging  to 
the  same  or  different  virus  "species". 
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FIGURE  1:  Cloning  and  sequencing  strategy  of  Junfn  virus  (Candid  #1)  S  RNA. 
The  S  RfJA  nucleotide  sequence  is  represented  in  the  viral  sense  with  a  size 
scale  in  kilobases  above.  Some  of  the  overlapping  clones  spanning  75%  of 
the  S  RNA,  are  shown  as  rectangles.  Phosphoryl ated  synthetic  oligonu¬ 
cleotides  (A,  B  and  C)  that  were  used  to  prime  cDNA  synthesis  arp  indicated 
as  black  boxes  at  the  beginning  of  each  clone.  Subcloning  into  M13  mpl8  and 
mp!9,  using  the  restriction  sites  that  are  indicated,  was  performed  in 
order  to  generate  the  nucleotide  sequence  data  by  the  method  of  Sanger  et 
al.  (1977).  The  arrows  indicate  the  length  of  the  sequences  that  were  ob¬ 
tained  using  a  modified  T7  DNA  polymerase  (Sequenase1M,  USB)  on  different 
Ml 3  subclones. 
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CUGCAGUAAG  GGGAUCCUAG  GCGAUUUUGG  UAACGCUGUA  AGUUGUUACU  GCUUUCUAUU  UGGACAACAU  CAAACCAUCU  80 
AUUGUACAfcij~GfjGGCAAUUC  AUCAGCUUCA  UGCAAGAAAU  ACCUACCUUU  UUGCAGGAAG  CUCUGAAUAU  UGCUCUUGUU  160 
GCAGUCAGUC  OCAUUGCCAU  CAUUAAGGGU  GUAGUAAACC  UGUACAAAAG  UGGUUUGUUC  CAAUUCUUUG  UAUUCCUAGC  240 

ACUCGCAGGA  AGAUCCUGCA  CAGAAGAAGC  UUUUAAAAUC  GGACUGCACA  CAGAGUUCCA  GACUGUGUCC  UUCUCAAUGG  320 

UGGGUCUCUU  UUCCAACAAU  CCACAUGACC  UGCCUCUGUU  CUGUACCUUA  AACAAGAGCC  AUCUUUACAU  UAAGGGGGGC  400 

AAUGCUUCAU  UCAAGAUCAG  ClAJUGAUGAC  AUCGCAGUGU  UCUUACCAGA  AUAUGACGUU  AUAAUUCAGC  AUCCGGCACA  480 

UAUGAGCUGG  UGUUCOAAAA  GUGAUGAUCA  AAUUUGGCUG  UCUCAGUGGU  UCAUGAAUGC  UGUGGGGCAU  GAUUGGUAUC  560 
UAGACCCACC  AUUUCUGUGU  AGGAACCGUA  CAAAGACAGA  AGGCUUCAUC  UUUCAAGUCA  AUACCUCCAA  GACUGGUAUC  6*0 
AAUGAAAACU  AUGCCAAGAA  GUUJAAGACU  GGUAUGCACC  AUUUAUAUAG  AGAAUACCCC  GACUCUUGCU  UGGAUCGCAA  720 
ACUGUGUUUG  AUGAAGGCAC  AACCCACCAG  UUGGCCUCUC  CAAUGUCCAC  UUGACCAUGU  CAACACAUUA  CAUUUCCUCA  800 
CAAGAGGCAA  GAACAUUCAG  CLAJCCAAGGA  GGUCUUUAAA  AGCAUUCUUU  UCCUGGUCUC  UGACAGACUC  AUCCGGCAAG  880 
GACACCCCUG  GAGGCUAUUG  UCUAGAAGAG  UGGAUGCUCG  UUGCACCCAA  AAUGAAGUGU  UUUGGCAAUA  CUGCUGUAGC  960 
AAAAUGCAAU  CUGAAUCAUG  ACUCUGAAUU  v.  GUGACAUG  CUGAGGCUUU  UUGAUUACAA  CAAAAAUGCU  AUCAAAACCU  1040 
UAAAUGAUGA  AACUAAGAAA  CAAGUAAAUC  UGAUGGGACA  GACAAUCAAU  GCGCUGAUAU  CUCACAAUUU  AUUGAUGAAA  1120 
AACAAAAUUA  GGGAACUGAU  GAGUGUCCCU  UACUGCAAUU  ACACAAAAUU  UUGGUAUGUC  AACCACACAC  UUUCAGGACA  1200 
ACACUCAUUA  CCAAGGUGCU  GGUUAAUAAA  AAACAACAGC  UAUUUGAACA  UCUCUGACUU  CCGUAAUGAC  UGGAUAUUAG  1230 

AAAGUGACUU  CUUAAUUUCU  GAAAUGCUAA  GCAAAGAGUA  UUCGGACAGG  CAGGGUAAAA  CUCCUUUGAC  UUUAGUUGAC  1360 

AUCUGUAUUU  GGAGCACAGU  AUUCUUCACA  GCGUCACUCU  UCCUUCACUU  GGUGGGUAUA  CCCUCCCACA  GACACAUCAG  1440 

GGGCGAAGCA  UGCCCUUUGC  CACACAGGUU  GAACAGCUUG  GGUGGUUGCA  GAUGUGGUAA  GUACCCCAAU  CUAAAGAAAC  1520 

CAACAGUUUG  GCGUAGAGGA  CAC^JAAjGACC  UCCUGAGGGU  CCCCACCAGC  CCGGGCACUG  CCCGGGCUGG  UGUGGCCCCC  1600 

CAGUCCGCGG  CCUGGCCGCG  GACUGGGGAG  GCACUGCjOUAj  CAGUGCAUAG  GCUGCCUUCG  GGAGGAACAG  CAAGCUCGGU  1680 

GGUAAUAGAG  GUGUAGGUUC  CUCCUCAUAG  AGCUUCCCAU  CUAGCACUCA  CUGAAACAUU  AUGCAGUCUA  GCAGAGCACA  1760 

GUGUGGUUr.A  CUGGAGGCCA  ACUUGAAGGG  AGUAUCCUUU  UCCCUCUUUU  UCUUAUUGAC  AACCACUCCA  UUGUGAUAUU  1840 

UGCAUAAGUG  ACCAUAUUUC  UCCCAGACCU  GUUGAUCAAA  CUGCCUGGCU  UGUUCAGAUG  UGAGCUUAAC  AUCAACCAGU  1920 

UUAAGAUCUC  UUCUUCCAUG  GAGGC'CAAAC  AACUUCCUGA  UGUCAUCGGA  UCCUUGACUA  GUCACAACCA  UGUCUGGAGG  2000 

CACCAAGCCG  AUCACGUAAC  UAAGAACUCC  UGGCAUGCAU  CUUCUAUGUC  CUUCAUUAAG  AUGCCGUGAG  AGUGUCUGCU  2080 

ACCAUUUUUA  AACCCUUUCU  CAUCAUGUGG  UUUUCUGAAG  CAGUGAAGUC  UGCUU  2135 


FIGURE  2:  Parcial  nucleotide  sequence  of  Junin  virus  (Candid  HI)  5  RNA 
A  2135-nucleotide-long  sequence  S  RNA  is  presented  in  the  viral  or 
genome  sense.  1  he  translational  initiation  and  termination  codons  of 
GRC  and  the  termination  anticodon  of  N  (encoded  in  the  viral -comple¬ 
mentary  sequence)  are  boxed.  Two  inverted  repeats  at  the  interyenic 
region  are  underlined. 
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1 

61 

121 

181 

241 

301 

361 

421 

481 


a  b  c 

MGQ  F I S  FMQ  E I PT  F  LQE  A  LN I A  LV  AVS  LI  A 1 1  KG  VVN  LY  KS  G  LFQFFVF  LA  LAGRS  CTEE 

-CSILD-  ==  PR- 


_ _ 

afkiglhtefqtvsfsmvglfsnnphdlpllcti|nks|hlyikgg|nas|fkisfddiavllp 

VPDCVLLQWWVS 


e  _ 

eydviiqhpadmswcsksddqiwlsqwfmnavghdwyldppflcr^rt|ktegfifqv|nts 

R 

KTGINENYAKKFKTGMHHLYREYPDSCLDGKLCLMKAQPTSWPLQCPLDHVNTLHFLTRG 

KNIQLPRRSLKAFFSWSLTDSSGKDTPGGYCLEEWMLVAAKMKCFGNTAVAKCNLNHDSE 


FCDMLRLFDYNKNAIKTLNDETKKQVNLMGQTINALISDNLLMKNKIRELMSVPYCNYTK 


FW 


LSGQHS  LPRCWLI 


kJnnsJyl^is 


DFRNDWILESDFLISEMLSKEYSDRQGKTPL 


f  g 

TLVDICIWSTVFFTASLFLHLVGIPSHRHIRGEACPLPHRLNSLGGCRCGKYPNLKKPTV 
F  T 


WRRGH 


FIGURE  3:  Amino  acid  sequence  of  Jurn'n  virus  Candid  #1  GPC  gene  products. 
The  regions  were  Candid  $1  GPC  sequence  differs  from  that  of  the  wt  MC 2 
strain  are  identified  by  letters  a  through  g  (for  easy  cross-reference  with 
table  3)  and  underlined.  The  amino  acid  sequence  of  the  MC2  GPC  protein  in 
these  regions  appears  below  the  lines).  The  putative  N-glycosyl ation  sites 
are  boxed.  Two  dots  (yy)  highlight  the  double  basic  amino  acid  sequence 
identified  by  Buchmeier  et  al .  (1987)  as  the  putative  proteolytic  cleavage 
site  that  generates  GI  (NH?-terminal  half)  and  G2  (C00H- terminal  half  of 
GPC). 
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J/C01  SIVFFTASLFLHLVGIPSHRHIRGfACPtPrtll  — KSlGGCRCGICr-PHLKKPTVWRRGH  540 


US 
l  CM 
J/MC2 
J/C01 
TAC 
PIC 


-KQPGV§VA§  | 

-kvpgvk|  j  |: 


t(H'T|vM^ltmg 


FIGURE  4:  Comparison  of  arenavirus  GPC  proteins.  The  predicted  amino  acid  sequences  of 
the  GPC  proteins  of  Lassa  (LAS,  Auperin  et  al . ,  1986),  LCM-WE  (ICM,  Romanowski,  e(  a/., 
1985),  Tacaribe  (TAC,  Franze-Fern4ndez  et  al . ,  1987),  Pichind6  (PIC,  Auperin  et  al . , 
1984c)  and  Junin-MC2  (J/MC2,  Ghlringhelli  et  al.,  submitted)  are  compared  to  that  of 
Junf n-Candid  #1  (J/C01)  GPC. 

The  amino  acid  residues  identical  to  those  of  J/CD1  GPC  protein  are  indicated  by  black 
boxes;  those  that  represent  conservative  changes  according  to  Schwartz  and  Dayhoff 
(1979)  appear  as  shaded  areas.  Gaps  were  Incorporated  in  the  sequences  in  order  to 
obtain  the  best  fit  alignment.  The  complete  amino  acid  sequence  of  J/CD1  GPC  protein, 
that  was  used  as  reference,  is  printed  above  the  aligned  sequences.  In  addition,  the 
conserved  N-glycosylatfon  sites  (Nx|)  are  indicated  by  thick  lines  above  the  JUN  GPC 
sequence.  The  position  of  the  proteolytic  cleavage  site  Is  indicated  by  a  triangle 
(the  amino  acid  sequence  to  the  left  of  this  site  is  referred  in  the  text  as  G1  and 
that,  to  the  right,  as  G2). 


2k 


nucleotide  number 


1  60  120  180  240  300  360  420  480 


arninoacid  number 


FIGURE  6:  Distribution  of  the  amino  acid  and  nucleotide  changes  in  the  GPC 
protein,  the  corresponding  open  reading  frame  and  the  5'  and  3'  untrans¬ 
lated  regions.  The  nucleotide  and  amino  acid  sequences  of  Candid  #1  and  MC? 
strains,  aligned  as  in  Fig.  5  and  3,  were  subdivided  in  fractions  of  30  nu¬ 
cleotides  and  10  amino  acids,  respectively,  in  order  to  score  the  differ¬ 
ences  found  while  scanning  the  sequences.  The  upper  panel  is  a  histogram- 
plot  of  the  nucleotide  changes  (%)  calculated  from  the  pairwise  comparison 
of  the  homologous  30-nuclectide  fragments  of  Candid  #1  and  MC2  S  RNAs.  The 
boundaries  of  the  GPC  ORF  are  indicated  by  two  vertical  lines,  that  con¬ 
tinue  into  the  lower  panel.  In  this  panel,  the  changes  (%)  in  the  amino 
acid  sequence  of  the  10-amino-acid-residue  fractions  of  Candid  #1  and  III 2 
GPC  proteins  were  plotted. 
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FIGURE  7:  Intergenic  region  of  Junin  virus  S  RNA.  A  portion  of  the  S  RNA  of 
Junin-Candid  #1  (A)  virus  comprising  the  ends  of  the  GPC  and  N  genes  (sense 
and  anti -sense  strand,  respectively)  and  the  non-coding  intergenic  region 
is  shown  in  the  figure.  The  directions  of  translation  are  indicated  by 
arrows  and  the  stop  codon  for  the  GPC  protein  and  the  stop  anticodon  for 
the  N  gene  are  marked  with  asterisks.  Two  potential  hairpin-loop  structures 
are  stabilized  by  13  and  17  base  pairs,  respectively  (AG°  =  -39.0  and  -54.2 
kcal/mole).  In  the  lower  part  of  this  figure  (B),  the  homologous  region  of 
Junin-MC2  virus  S  RNA  is  shown  for  comparison.  The  nucleotides  C1541,  A1544 
and  T]56S  of  MC2  have  been  changed  to  T15S4.  G1  557  and  C]S78  in  Candid  #1. 
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TABLE  2 


SEQUENCE  HOMOLOGY  (%)  OF  ARENAVIRUS  GPC  PROTEIN, 
AMINO  ACID  IDENTITY  AND  CONSERVATIVE  SUBSTITUTIONS 


J/MC2  TAC  PIC  LCM  US 

J/CD1 

92  74  43  42  42 

J/MC2 

69  43  42  44 

TAC 

41  38  36 

PIC 

41  47 

LCM 

44 

J/MC2  TAC  PIC  LCM  LAS 


89 

71 

71 

69 

92 

72 

70 

68 

70 

69 

67 

68 

69 

83 

G1 


G2 


AMINOACID  SIMILARITY  (Schwartz  and  Dayhoff,  1979) 


A 

— 

G; 

P 

M 

= 

V; 

F; 

L;  1 

C 

= 

S; 

Y 

N 

= 

D; 

E; 

H 

D 

E; 

G; 

N; 

Q 

P 

= 

A 

E 

= 

G; 

N; 

Q; 

D 

Q 

= 

E; 

D; 

H 

F 

= 

1; 

L; 

M; 

W; 

Y 

R 

= 

W; 

H; 

K 

G 

= 

S; 

D; 

A; 

E 

S 

= 

C; 

G 

H 

= 

N; 

Q; 

R 

T 

= 

1 

= 

L; 

M; 

V; 

F 

V 

= 

1; 

L; 

M 

K 

= 

R 

W 

= 

Y; 

L; 

F;  R 

L 

= 

1; 

V; 

W; 

M; 

F 

Y 

C; 

F; 

W 

A 

=  AJa 

F 

=  Phe 

K 

=  Lys 

P 

-  Pro 

T  = 

Thr 

C 

-  Cys 

G 

=  Gly 

L 

=  Leu 

Q 

=  Gin 

V  = 

Val 

D 

=  Asp 

H 

=  His 

M 

=  Met 

R 

=  Arg 

W  = 

Trp 

E 

=  Glu 

1 

=  lie 

N 

=  Asrt 

S 

*  Ser 

Y  = 

Tyr 
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TABLE  3 


Localization  of  nucleotide  and  amino  acid  changes  in  the  glycoprotein  genes 
of  wild  type  MC2  and  vaccine  Candidll  strains  of  Junin  virus 


position 

nucleotide 

|  codon 

amino  acid 

MC2 

CD1 

MC2 

-♦ 

CD1 

j  MC2  -  CD1 

MC2 

-4 

CDl 

1 

- 

C 

upstream 

untrans  latcd 

37 

38 

A 

-4 

G 

non  coding 

reg  ion 

214 

*216 

ins . 

U 

stepwise 

218 

*221 

ins . 

c 

change  of 

225 

*229 

ins. 

u 

reading 

226 

231 

ins . 

u 

f  rames 

CSILD 

LFQFFVFa 

228 

*234 

ins. 

u 

(2  codons 

229 

*236 

ins . 

c 

extra) 

234 

242 

1  6  nt 

.  block 

cue 

to 

to 

insert  ion 

-♦ 

LAb 

235 

247 

CUCGCA 

GCA 

■ 

stepwise 

247 

260 

C 

change  of 

248 

262 

ins. 

11' 

reading 

250 

265 

ins. 

n 

frames 

PR 

- 

TEEC 

252 

268 

ins. 

u 

( 1  codon 

H 

extra) 

stepwise 

280 

295 

del  . 

G 

change  of 

300 

314 

del  . 

U 

reading 

VPDCVLLQWWVS  -  FQTVSFSMVGl1’ 

315 

*333 

del . 

U 

f  r  ames 

(1  codon  loss) 

502 

515 

A 

-* 

u 

AGG  -  UGG 

R 

We 

1123 

1136 

U 

c 

UUG  -  CUG 

L 

-* 

L 

1239 

1252 

U 

-* 

c 

AUU  -  AUC 

I 

- 

I 

1264 

1277 

C 

-♦ 

u 

CUA  -  UUA 

L 

- 

L 

1323 

1336 

C 

-4 

u 

GGC  -  GGU 

G 

-► 

G 

1332 

1345 

C 

-* 

u 

CCC  -  ecu 

P 

-4 

P 

1354 

1367 

U 

-* 

A 

UUU  -  AUU 

F 

-4 

If 

1383 

1396 

C 

A 

UCC  -  UCA 

S 

-* 

S 

1404 

1417 

C 

-4 

U 

GGC  -  GGU 

G 

G 

1411 

1424 

A 

-* 

u 

ACC  -  UCC 

T 

-4 

S9 

1416 

1429 

U 

-4 

c 

CAU  -  CAC 

II 

-4 

II 

1419 

1432 

G 

-4 

A 

AGG  -  AGA 

R 

-4 

R 

1428 

1441 

A 

-4 

G 

CAA  -  CAG 

R 

-4 

R 

1434 

1447 

G 

-4 

A 

CGG  -  CGA 

E 

-4 

E 

1444 

1457 

C 

-* 

U 

CUG  -  UUG 

L 

- 

L 

1449 

1462 

C 

-4 

A 

CCC  -  CCA 

P 

-> 

P 

1456 

1469 

C 

-4 

U 

CUA  -  UUG 

L 

-4 

L 

1458 

1471 

A 

-4 

G 

CUA  -  UUG 

L 

-4 

1. 

1461 

1474 

U 

-4 

C 

AAU  -  AAC 

N 

-4 

N 

1521 

1534 

c 

-* 

u 

CGC  -  CGU 

R 

- 

R 

1541 

1554 

c 

-4 

u 

downstream 

untrans  latcd 

1544 

1557 

A 

-4 

G 

non  coding 

region 

1565 

1578 

U 

-4 

C 

*  In  insertions  or  deletions  in  which  4  stretch  of  the  seme  nucleotide  is  involved,  it  Is  not  possible  to  specify  the  exact  po 
sition  of  the  mutational  change.  Therefore,  the  last  nucleotide  In  the  stretch  is  indicated  to  identify  the  sequence  position 


a.b.c.d.e.f .g:  refer  to  the  regions  Indicated  in  Fig  3 
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